Search CORE

1,326 research outputs found

Observation and Numerical Simulation of Terrain-Induced Windshear at the Hong Kong International Airport in a Planetary Boundary Layer without Temperature Inversions

Author: K. K. Hon
P. W. Chan
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2016
Field of study

Terrain-induced windshear at Hong Kong International Airport (HKIA) could be hazardous to the landing and departing aircraft. Such windshear occurring in a planetary boundary layer without temperature inversions is studied in this paper by using the data from the Terminal Doppler Weather Radar and Light Detection and Ranging systems. A high resolution numerical model, called aviation model (AVM), is also employed to find out its capability to forecast the occurrence of such windshear. The model is found to have skills in capturing the terrain-induced windshear, including the terrain-induced microburst due to the mountains of Lantau Island. Moreover, the windshear detection algorithm as applied to the AVM output, called AVM-GLYGA, is able to give advance alert for the occurrence of low-level windshear. The model also offers new dataset, such as vertical velocity and vertical cross sections across the windshear feature, to study the terrain-induced windshear phenomena with new insights. The AVM is found to have good skills in depicting the terrain-disrupted airflow at the airport area, and more comprehensive study would be conducted to study the skills of AVM-GLYGA as compared with pilot windshear report as sky truth

Crossref

Directory of Open Access Journals

Succinct Dictionary Matching With No Slowdown

Author: A.V. Aho
J.I. Munro
K. Sadakane
P. Elias
R.M. Fano
S. Dori
W.-K. Hon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The problem of dictionary matching is a classical problem in string matching: given a set S of d strings of total length n characters over an (not necessarily constant) alphabet of size sigma, build a data structure so that we can match in a any text T all occurrences of strings belonging to S. The classical solution for this problem is the Aho-Corasick automaton which finds all occ occurrences in a text T in time O(|T| + occ) using a data structure that occupies O(m log m) bits of space where m <= n + 1 is the number of states in the automaton. In this paper we show that the Aho-Corasick automaton can be represented in just m(log sigma + O(1)) + O(d log(n/d)) bits of space while still maintaining the ability to answer to queries in O(|T| + occ) time. To the best of our knowledge, the currently fastest succinct data structure for the dictionary matching problem uses space O(n log sigma) while answering queries in O(|T|log log n + occ) time. In this paper we also show how the space occupancy can be reduced to m(H0 + O(1)) + O(d log(n/d)) where H0 is the empirical entropy of the characters appearing in the trie representation of the set S, provided that sigma < m^epsilon for any constant 0 < epsilon < 1. The query time remains unchanged.Comment: Corrected typos and other minor error

arXiv.org e-Print Archive

CiteSeerX

Crossref

Compressed Data Structures for Dynamic Sequences

Author: A. Gupta
D. Belazzougui
G. Manzini
G. Navarro
H.-L. Chan
J. Barbay
J. Jansson
L. Arge
M. He
R. Grossi
S. Lee
S. Lee
V. Mäkinen
W.-K. Hon
W.-K. Hon
Publication venue
Publication date: 24/07/2015
Field of study

We consider the problem of storing a dynamic string

S

over an alphabet

\Sigma=\{\,1,\ldots,\sigma\,\}

in compressed form. Our representation supports insertions and deletions of symbols and answers three fundamental queries:

\mathrm{access}(i,S)

returns the

i

-th symbol in

S

\mathrm{rank}_a(i,S)

counts how many times a symbol

a

occurs among the first

i

positions in

S

, and

\mathrm{select}_a(i,S)

finds the position where a symbol

a

occurs for the

i

-th time. We present the first fully-dynamic data structure for arbitrarily large alphabets that achieves optimal query times for all three operations and supports updates with worst-case time guarantees. Ours is also the first fully-dynamic data structure that needs only

nH_k+o(n\log\sigma)

bits, where

H_k

is the

k

-th order entropy and

n

is the string length. Moreover our representation supports extraction of a substring

S[i..i+\ell]

in optimal

O(\log n/\log\log n + \ell/\log_{\sigma}n)

time

arXiv.org e-Print Archive

CiteSeerX

Crossref

Periodically-Poled Silicon [Updated]

Author: Bahram Jalali
Boyd R. W.
Daniel R. Solli
Dolgova T. V.
Kevin K. Tsia
Nick K. Hon
Publication venue: 'AIP Publishing'
Publication date: 01/01/2009
Field of study

We propose a new class of photonic devices based on periodic stress fields in silicon that enable second-order nonlinearity as well as quasi-phase matching. Periodically-poled silicon (PePSi) adds the periodic poling capability to silicon photonics, and allows the excellent crystal quality and advanced manufacturing capabilities of silicon to be harnessed for devices based on second-order nonlinear effects. As an example of the utility of the PePSi technology, we present simulations showing that mid-wave infrared radiation can be efficiently generated through difference frequency generation from near-infrared with a conversion efficiency of 50%. This technology can also be implemented with piezoelectric material, which offers the capability to dynamically control the X(2) nonlinearity.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

Crossref

HKU Scholars Hub

Large-vocabulary speaker-independent continuous speech recognition with semi-continuous hidden Markov models

Author: H. W. Hon
K. F. Lee
X. D. Huang
Publication venue
Publication date: 01/01/1989
Field of study

A semi-continuous hidden Markov model based on the muluple vector quantization codebooks is used here for large.vocabulary speaker-independent continuous speech recognition in the techn,ques employed here. the semi-continuous output probab~hty densHy function for each codebook is represented by a comhinat,on of the corre,~ponding discrete output probablhttes of the hidden Markov model end the continuous Gauss,an den. stay functions of each individual codebook. Parameters of vec. tor qusnttzation codebook and hidden Markov model are mutuully optimized to achJeve an optimal model'codebook comb,nation under a untried probab,hshc framework Another advantages of thts approach is the enhanced robustness of the semi. continuous output probability by the combination of multiple codewords and multtple codebooks For a 1000.word speakermdependen

CiteSeerX

Crossref

Bankruptcy Law

Author: Gaffey David W.
Sieg K. Elizabeth
Tice Hon. Douglas O., Jr.
Publication venue: UR Scholarship Repository
Publication date: 01/11/2012
Field of study

University of Richmond

An Efficient Alignment Algorithm for Searching Simple Pseudoknots over Long Genomic Sequence

Author: Hon W
Lam TW
Ma CCC
Sadakane K
Wong KF
Yiu SM
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

published_or_final_versio

HKU Scholars Hub

Weathering of Radial and Tangential Wood Surfaces of Pine and Spruce

Author: Arnold M.
Borgin K.
Borgin K.
Boutelje J.
Browne F. L.
Browne F. L.
Derbyshire H.
Desai R. L.
Feist W. C.
Feist W. C.
Hon D. N.
Hon D. N.
Hon D. N.
Hon D. N.
Kalnins M. A.
Kenaga D. L.
Kringstad K.
Sell J.
Publication venue: 'Walter de Gruyter GmbH'
Publication date
Field of study

Crossref

Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array

Author: D Okanohara
J Fischer
J Fischer
J Fischer
J Kärkkäinen
J Kärkkäinen
J Kärkkäinen
J Sirén
JI Munro
JS Vitter
K Sadakane
K Sadakane
P Ferragina
P Ferragina
P Ferragina
R Dementiev
T Beller
T Kasai
U Manber
W Hon
W Szpankowski
Publication venue
Publication date: 01/01/2016
Field of study

The longest common prefix (LCP) array is a versatile auxiliary data structure in indexed string matching. It can be used to speed up searching using the suffix array (SA) and provides an implicit representation of the topology of an underlying suffix tree. The LCP array of a string of length

n

can be represented as an array of length

n

words, or, in the presence of the SA, as a bit vector of

2n

bits plus asymptotically negligible support data structures. External memory construction algorithms for the LCP array have been proposed, but those proposed so far have a space requirement of

O(n)

words (i.e.

O(n \log n)

bits) in external memory. This space requirement is in some practical cases prohibitively expensive. We present an external memory algorithm for constructing the

2n

bit version of the LCP array which uses

O(n \log \sigma)

bits of additional space in external memory when given a (compressed) BWT with alphabet size

\sigma

and a sampled inverse suffix array at sampling rate

O(\log n)

. This is often a significant space gain in practice where

\sigma

is usually much smaller than

n

or even constant. We also consider the case of computing succinct LCP arrays for circular strings

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Efficient Representation of Multidimensional Data over Hierarchical Domains

Author: H Samet
K Sadakane
M Levene
NR Brisaboa
NR Brisaboa
NR Brisaboa
R Kimball
S Chaudhuri
W Hon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/09/2016
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-46049-9_19[Abstract] We consider the problem of representing multidimensional data where the domain of each dimension is organized hierarchically, and the queries require summary information at a different node in the hierarchy of each dimension. This is the typical case of OLAP databases. A basic approach is to represent each hierarchy as a one-dimensional line and recast the queries as multidimensional range queries. This approach can be implemented compactly by generalizing to more dimensions the k2k2 -treap, a compact representation of two-dimensional points that allows for efficient summarization queries along generic ranges. Instead, we propose a more flexible generalization, which instead of a generic quadtree-like partition of the space, follows the domain hierarchies across each dimension to organize the partitioning. The resulting structure is much more efficient than a generic multidimensional structure, since queries are resolved by aggregating much fewer nodes of the tree.Ministerio de Economía, Industria y Competitividad; TIN2013-46238-C4-3-RMinisterio de Economía, Industria y Competitividad; IDI-20141259Ministerio de Economía, Industria y Competitividad; ITC-20151305Ministerio de Economía y Competitividad; ITC-20151247Xunta de Galicia; GRC2013/053Chile.Fondo Nacional de Desarrollo Científico y Tecnológico; 1-140796COST. IC130

arXiv.org e-Print Archive

Repositorio da Universidade da Coruña

Crossref